Loop nest optimization

Results: 25



#Item
11Anatomy of High-Performance Many-Threaded Matrix Multiplication Tyler M. Smith∗ , Robert van de Geijn∗ , Mikhail Smelyanskiy† , Jeff R. Hammond‡ and Field G. Van Zee∗ ∗ Institute  for Computational Engineerin

Anatomy of High-Performance Many-Threaded Matrix Multiplication Tyler M. Smith∗ , Robert van de Geijn∗ , Mikhail Smelyanskiy† , Jeff R. Hammond‡ and Field G. Van Zee∗ ∗ Institute for Computational Engineerin

Add to Reading List

Source URL: www.cs.utexas.edu

Language: English - Date: 2014-02-10 15:58:29
12A quad-tree based Sparse BLAS implementation for shared memory parallel computers Michele Martone Universit` a di Roma “Tor Vergata”, Italy

A quad-tree based Sparse BLAS implementation for shared memory parallel computers Michele Martone Universit` a di Roma “Tor Vergata”, Italy

Add to Reading List

Source URL: claudius.ce.uniroma2.it

Language: English - Date: 2013-03-15 13:10:13
13Hierarchical Diagonal Blocking and Precision Reduction Applied to Combinatorial Multigrid∗ Guy E. Blelloch Ioannis Koutis

Hierarchical Diagonal Blocking and Precision Reduction Applied to Combinatorial Multigrid∗ Guy E. Blelloch Ioannis Koutis

Add to Reading List

Source URL: ccom.uprrp.edu

Language: English - Date: 2011-03-02 23:58:23
14Experience in accelerating linear algebra using GPUs Vasily Volkov UC Berkeley October 6, 2011 1

Experience in accelerating linear algebra using GPUs Vasily Volkov UC Berkeley October 6, 2011 1

Add to Reading List

Source URL: wwwsfb.tpi.uni-jena.de

Language: English - Date: 2011-10-10 06:51:23
15Numerical Libraries for Petascale Computing Brett Bode William Gropp  Why Use Libraries?

Numerical Libraries for Petascale Computing Brett Bode William Gropp Why Use Libraries?

Add to Reading List

Source URL: gladiator.ncsa.illinois.edu

Language: English - Date: 2010-05-11 14:28:31
16Design and Evaluation of a Compiler Algorithm for Prefetching Todd C. Mowry, Monica S. Lam and Anoop Gupta Computer Systems Laboratory Stanford University, CA 94305

Design and Evaluation of a Compiler Algorithm for Prefetching Todd C. Mowry, Monica S. Lam and Anoop Gupta Computer Systems Laboratory Stanford University, CA 94305

Add to Reading List

Source URL: www-suif.stanford.edu

Language: English - Date: 2006-06-25 21:11:32
17The New Framework for Loop Nest Optimization in GCC: from Prototyping to Evaluation Sebastian Pop Albert Cohen Pierre Jouvelot Georges-Andr´e Silber CRI, Ecole des mines de Paris, Fontainebleau, France ALCHEMY, INRIA Fu

The New Framework for Loop Nest Optimization in GCC: from Prototyping to Evaluation Sebastian Pop Albert Cohen Pierre Jouvelot Georges-Andr´e Silber CRI, Ecole des mines de Paris, Fontainebleau, France ALCHEMY, INRIA Fu

Add to Reading List

Source URL: www.cri.ensmp.fr

Language: English - Date: 2006-02-10 18:39:21
18The New Framework for Loop Nest Optimization in GCC: from Prototyping to Evaluation Sebastian Pop1 , Albert Cohen2 , Pierre Jouvelot1 , and Georges-Andr´e Silber1 1

The New Framework for Loop Nest Optimization in GCC: from Prototyping to Evaluation Sebastian Pop1 , Albert Cohen2 , Pierre Jouvelot1 , and Georges-Andr´e Silber1 1

Add to Reading List

Source URL: www.cri.ensmp.fr

Language: English - Date: 2005-11-22 06:53:34
19The task ! A modified form of matrix multiplication:  C = f ( A, B )

The task ! A modified form of matrix multiplication: C = f ( A, B )

Add to Reading List

Source URL: www.cs.inf.ethz.ch

Language: English - Date: 2002-02-06 05:39:40